OCFSP: self-supervised one-class classification approach using feature-slide prediction subtask for feature data
نویسندگان
چکیده
One-class classification (OCC) is a machine learning problem where training data has only one class. Recently, self-supervised OCC algorithms have been increasing attention. These train the model for pretext tasks and use error OCC. However, these are specialized images, applying them to feature not practical or appropriate such purpose. The motivation of this study apply data. For purpose, paper proposes an approach using feature-slide prediction (FSP) subtask (OCFSP). main originality FSP subtask, which first In particular, proposed method creates self-labeled dataset by generating additional vectors with slide original self-annotating as number slides. Such applied multi-class classifier predict Since learns from class, accuracy seen class higher relative unseen classes. Accordingly, could be made FSP. methods experimented imbalanced-learn, covtype, kddcup datasets. OCFSP shows fair few given. addition, relatively fast testing speed, unlike image Therefore, bottleneck considered memory size, difference between Source code uploaded at https://github.com/ToshiHayashi/OCFSP
منابع مشابه
Feature Extraction for One-Class Classification
Feature reduction is often an essential part of solving a classification task. One common approach for doing this, is Principal Component Analysis. There the low variance directions in the data are removed and the high variance directions are retained. It is hoped that these high variance directions contain information about the class differences. For one-class classification or novelty detecti...
متن کاملFeature selection using genetic algorithm for classification of schizophrenia using fMRI data
In this paper we propose a new method for classification of subjects into schizophrenia and control groups using functional magnetic resonance imaging (fMRI) data. In the preprocessing step, the number of fMRI time points is reduced using principal component analysis (PCA). Then, independent component analysis (ICA) is used for further data analysis. It estimates independent components (ICs) of...
متن کاملA Classification Method for E-mail Spam Using a Hybrid Approach for Feature Selection Optimization
Spam is an unwanted email that is harmful to communications around the world. Spam leads to a growing problem in a personal email, so it would be essential to detect it. Machine learning is very useful to solve this problem as it shows good results in order to learn all the requisite patterns for classification due to its adaptive existence. Nonetheless, in spam detection, there are a large num...
متن کاملSpatial Feature Extractions Using Supervised Fuzzy Classification
This paper emphasis on spatial feature extractions and selection techniques adopted in content based image retrieval that uses the visual content of a still image to search for similar images in large scale image databases, according to a user’s interest. The content based image retrieval problem is motivated by the need to search the exponentially increasing space of image databases efficientl...
متن کاملA Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Soft Computing
سال: 2022
ISSN: ['1433-7479', '1432-7643']
DOI: https://doi.org/10.1007/s00500-022-07414-z